SAS-Pro: Simultaneous Residue Assignment and Structure Superposition for Protein Structure Alignment
نویسندگان
چکیده
Protein structure alignment is the problem of determining an assignment between the amino-acid residues of two given proteins in a way that maximizes a measure of similarity between the two superimposed protein structures. By identifying geometric similarities, structure alignment algorithms provide critical insights into protein functional similarities. Existing structure alignment tools adopt a two-stage approach to structure alignment by decoupling and iterating between the assignment evaluation and structure superposition problems. We introduce a novel approach, SAS-Pro, which addresses the assignment evaluation and structure superposition simultaneously by formulating the alignment problem as a single bilevel optimization problem. The new formulation does not require the sequentiality constraints, thus generalizing the scope of the alignment methodology to include non-sequential protein alignments. We employ derivative-free optimization methodologies for searching for the global optimum of the highly nonlinear and non-differentiable RMSD function encountered in the proposed model. Alignments obtained with SAS-Pro have better RMSD values and larger lengths than those obtained from other alignment tools. For non-sequential alignment problems, SAS-Pro leads to alignments with high degree of similarity with known reference alignments. The source code of SAS-Pro is available for download at http://eudoxus.cheme.cmu.edu/saspro/SAS-Pro.html.
منابع مشابه
Multiple protein sequence alignment from tertiary structure comparison: assignment of global and residue confidence levels.
An algorithm is presented for the accurate and rapid generation of multiple protein sequence alignments from tertiary structure comparisons. A preliminary multiple sequence alignment is performed using sequence information, which then determines an initial superposition of the structures. A structure comparison algorithm is applied to all pairs of proteins in the superimposed set and a similari...
متن کاملStrategies of non-sequential protein structure alignments.
Due to the large number of available protein structure alignment algorithms, a lot of effort has been made to define robust measures to evaluate their performances and the quality of generated alignments. Most quality measures involve the number of aligned residues and the RMSD. In this work, we analyze how these two properties are influenced by different residue assignment strategies as employ...
متن کاملHelix Segment Assignment in Proteins Using Fuzzy Logic
The automatic assignment of protein secondary structure from three dimensional coordinates is an essential step in the characterization of protein structure. <span style="font...
متن کاملProtein Structure Alignment Using a Graph Matching Technique
This paper proposes new algorithms for protein structure alignment. Protein structure alignment is, given two three-dimensional protein structures, to nd spatially equivalent residue pairs. Each algorithm consists of the following two steps: rst an initial superposition is computed; then a structure alignment is computed and re ned using bipartite graph matching. The proposed algorithms are sho...
متن کاملData Dp Rand Frag Data 1 Data 2 Rmsd Len Time Rmsd Len Time Rmsd Len Time
This paper proposes new algorithms for protein structure alignment. Protein structure alignment is, given two three-dimensional protein structures, to nd spatially equivalent residue pairs. Each algorithm consists of the following two steps: rst an initial superposition is computed; then a structure alignment is computed and re ned using bipartite graph matching. The proposed algorithms are sho...
متن کامل